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Description 

NONLINEAR OVERLAP METHOD FOR 

TIME SCALING 

Background of Invention 
[000 1 ] 1. Field of the Invention 

[0002] The present invention relates to a signal-synthesizing 
method, and more particularly, to a nonlinear overlap 
method for time scaling. 

[0003] 2. Description of the Prior Art 

[0004] D ue t0 the dramatic progress in electronic technologies, 
an AV player such as a Karaoke can provide more and 
more amazing functions, such as audio clean-up, dynamic 
repositioning of enhanced audio and music (DREAM), and 
time scaling. Time scaling (also called time stretching, 
time compression/expansion, or time correction) is a 
function to elongate or shorten an audio signal while 
keeping the pitch of the audio signal approximately un- 
changed. In short, time scaling only adjusts the tempo of 



an audio signal. 

[0005] | n general, an AV player performs time scaling with one of 
the three following methods: Phase Vocoder, Minimum 
Perceived Loss Time Expansion/Compression (MPEX), and 
Time Domain Harmonic Scaling (TDHS). Phase Vocoder 
transforms an audio signal into a complex Fourier repre- 
sentation signal with Short Time Fourier Transform (STFT) 
and further transforms the complex Fourier representa- 
tion signal back to a time scaled audio signal correspond- 
ing to the original audio signal with interpolation tech- 
niques and iSTFT (inverse STFT). MPEX is a method re- 
searched and developed by Prosoniq for simulating char- 
acteristics of human hearing, similar to an artificial neural 
network. MPEX records audio signals received for a prede- 
termined period and tries to "learn" the audio signals, so 
as to either elongate or shorten the audio signals. TDHS is 
one of the most popular methods for time scaling. TDHS 
first establishes an autocorrelogram of a first audio signal, 
the autocorrelogram consisting of a plurality of magni- 
tudes, and then delays the first audio signal by a maxi- 
mum index corresponding to a maximum magnitude, a 
largest magnitude among all of the magnitudes of the au- 
tocorrelogram, to form a second audio signal, and lastly 



synchronizes and overlap-adds (SOLA) the first audio sig- 
nal to the second audio signal to form a third audio signal 
longer than the first audio signal. 
[0006] | n a computer system, the autocorrelogram is usually es- 
tablished by a digital signal processing (DSP) chip de- 
signed to manage complex mathematic calculation such 
as convolution and fast Fourier transform (FFT). However, 
a process by the DSP chip to synthesize the third audio 
signal from the first and second audio signals is tedious 

and sometimes unnecessary. 
Summary of Invention 

[0007] it i S therefore a primary objective of the claimed invention 
to provide a nonlinear overlap method for time scaling to 
efficiently synthesize a third audio signal from a first au- 
dio signal and a second audio signal without sacrificing 
the quality of the third audio signal dramatically. 

[0008] According to the claimed invention, the nonlinear overlap 
method for time scaling to synthesize an S 3 [n] signal from 
an SJn] signal and an S 2 [n] signal, the SJn] signal having 
N i elements and the S 2 [n] signal having N 2 elements, 
comprises: 

[0009] (a)delaying the S 2 [n] signal by a predetermined number of 
elements and forming an S [n] signal; 



[0010] (b)establishing a cross-correlogram of a cross-correlation 
function of the SJn] signal and the S 5 [n] signal, the cross- 
correlogram including a plurality of magnitudes, each of 
the magnitudes corresponding to an index; and 

[0011] (c)setting the S 3 [n] signal as values of the elements of: 

[0012] s [n], where 0 <= n < (the predetermined number + a 

first threshold value + a maximum index), the maximum 
index corresponding to a largest magnitude among all of 
the magnitudes of the cross correlogram; 

[0013] sjn] weights and adds to an SJn] signal that lags the S 
s [n] signal by the maximum index, where (the predeter- 
mined number + the first threshold value + the maximum 
index) <= n < (N i - a second threshold value); and 

[0014] sjn (the predetermined number + the maximum index)], 
where (N - the second threshold value) <= n <= (N 2 + 
the predetermined number + the maximum index); 

[0015] wherein the first and second threshold values are not 
equal to zero at the same time. 

[0016] ^ is an advantage of the claimed invention that the 

method calculates values between the first threshold and 
the second threshold instead of all values of the over- 
lapped signal from A to Z to save time for a DSP chip to 
synthesize the S [n] signal from the S [n] and S [n] signals 



and promote a computer where the DSP chip is installed 
in. 

[0017] These and other objectives of the claimed invention will 

no doubt become obvious to those of ordinary skill in the 

art after reading the following detailed description of the 

preferred embodiment that is illustrated in the various 

figures and drawings. 
Brief Description of Drawings 

[0018] pig.l is a flow chart of a method according to the present 
invention. 

[0019] Fig. 2 is a schematic diagram demonstrating how the 

method synthesizes an S 3 [n] signal from an SJn] signal 
and an S 2 [n] signal according to the present invention. 

[0020] pig. 3 is a schematic diagram demonstrating how the 
method elongates an audio signal according to the 
present invention. 

[0021] Fig. 4 is a schematic diagram demonstrating how the 

method shortens an audio signal according to the present 

invention. 
Detailed Description 

[0022] After establishing an autocorrelogram corresponding to a 
first audio signal and a second audio signal (or a signal 



lagging the first audio signal by a predetermined number), 
the autocorrelogram consisting of a plurality of magni- 
tudes, a method 100 of the preferred embodiment of the 
present invention determines a maximum index corre- 
sponding to a maximum magnitude, a largest magnitude 
in the autocorrelogram, and calculates a third audio signal 
according to the first audio signal, the second audio sig- 
nal, the maximum index, a first threshold and a second 
threshold. In detail, in order to save time for a digital sig- 
nal processing (DSP) chip to synthesize the third audio 
signal from the first and second audio signals, the method 
100, having determined the maximum index and delay- 
ingthe second audio signal by the maximum index, does 
not weight and add all of an overlapped signal mixed with 
the first audio signal and the second audio signal as well 
to the second audio signal but weights and adds part (a 
region between the first threshold and the second thresh- 
old) of the overlapped signal to the second audio signal 
instead and forms the third audio signal. 
[0023] please refer to Fig.l, which is a flow chart of a method 
100 of the preferred embodiment according to the 
present invention. The method 100 comprises the follow- 
ing steps: 



[0024] Step 102:Start; 

[0025] (An S 3 [n] signal is to be synthesized from an SJn] signal 
and an S 2 [n] signal. For simplicity, the SJn] signal and S 
2 [n] signals are defined to contain and N 2 signals re- 
spectively.) 

[0026] step 104:Delaying the S 2 [n] signal by a predetermined 
number A and forming an S 5 [n] signal; 

[0027] (| n order to prevent run-in from occurring in a process a 
pickup of an A/V player reads the S 3 [n] signal, the method 
100 delays the S 2 [n] signal by the predetermined number 
A and then determines an maximum index 

^ max 



crucial for the process to synthesize the S 3 [n] signal from 
the SJn] signal and the S 2 [n] signal. In the preferred em- 
bodiment, the predetermined number A is equal to [N/3].) 
[0028] step 106:Establishing an autocorrelogram of the SJn] and 
S 5 [n] signals and delaying the S 5 [n] signal to form an SJn] 
signal according to the maximum index 

t max 



corresponding to a maximum magnitude in the autocor- 
relogram; 

[0029] (The autocorrelogram comprises a plurality of magnitudes 
of a cross-correlation function, each of the magnitudes 
corresponding to a distinct index.) 

[0030] step 108:Synthesizing the S [n] signal from the S [n] sig- 



nal and the S [n]signal; 
[0031] (The S 3 [n] signal is equal to 

[0032] the SJn] signal, where 0 <= n < (the predetermined num- 
ber A + a first threshold value th + the maximum index 

l 

^ max 



); 

[0033] the SJn] signal weights and adds to the SJn] signal, 

where (the predetermined number A + the first threshold 

value th + the maximum index 
i 



7" max 



) <= n < (N a second threshold value th ); and 

1 2 ' 

[0034] the S 4 [n] (the predetermined number A + the maximum 
index 



7" max 



)] signal, where (N i - the second threshold value th 2 ) <= n 
<= (N 2 + the predetermined number A + the maximum 
index 



7" max 



); 

[0035] wherein the first threshold value th and second threshold 

1 

value th 2 are not equal to zero at the same time.) 
[0036] stepllO:End. 

[0037] please refer to Fig. 2, which is a schematic diagram 

demonstrating how the method 100 synthesizes the S 3 [n] 
signal from the SJn] and S 2 [n] signals according to the 
present invention. In Fig. 2, a first part 401 shows the SJn] 
and S 2 [n] signals in the step 102 of the method 100, a 
second part 402 shows the SJn] and S 5 [n] signals calcu- 
lated from the step 104 of the method 100, a third part 



403 shows the maximum index 

7" max 



and the SJn] signal calculated from the step 106 of the 
method 100, a fourth part 404 and a fifth part 405 the S 
3 [n] signal synthesized from the SJn] and the SJn] signals 
in the step 108 of the method 100. 
[0038] The S 3 [n] signal shown in the fourth part 404 of Fig. 2 is 
equal to 



, where (the predetermined number A + the maximum in- 
dex 

max 



+ the first threshold value th ) <= n < (N the second 

1 1 

threshold value th ). 

2 



[0039] The S 3 [n] signal shown in the fourth part 405 of Fig. 2 is 
equal to 



, where (the predetermined number A + the maximum in- 
dex 



7" max 



+ the first threshold value th ) <= n < (N the second 

1 1 

threshold value th ). 
[0040] if the SJn] signal is the same as the S 2 [n] signal and both 
are derived from the S[n] at an identical region, as shown 
in Fig. 3, the method 100 in fact elongates the S [n]. On 
the contrary, if the SJn] signal and the S 2 [n] signals are 
different from each other and are derived from the S[n] at 
two distinct regions respectively, as shown in Fig. 4, the 
method 100 in fact shortens the S [n], an S [n] (discarded) 

1 6 

and the S 2 [n] signals into the S 3 [n] signal. 
[0041] | n contrast to the prior art, the present invention can pro- 
vide a method to synthesize the S 3 [n] signal from the SJn] 



and S 2 [n] signals based on the maximum index corre- 
sponding to the maximum magnitude of the autocorrelo- 
gram and the first and second threshold values for con- 
fining the overlapped signal simultaneously mixed with 
the SJn] and the S 2 [n] signals. Instead of calculating all 
values of the overlapped signal from A to Z, the method 
calculates values between the first threshold and the sec- 
ond threshold to save time for a DSP chip to synthesize 
the S 3 [n] signal from the SJn] and S 2 [n] signals and pro- 
mote a computer where the DSP chip is installed in. 
[0042] Following the detailed description of the present invention 
above, those skilled in the art will readily observe that nu- 
merous modifications and alterations of the device may be 
made while retaining the teachings of the invention. Ac- 
cordingly, the above disclosure should be construed as 
limited only by the metes and bounds of the appended 
claims. 



